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Be it known that I, Anne G. O'Connell, a citizen of the Republic of Ireland, residing at 
3 Woodberry, Carpenterstown Road, Castleknoclc^ Dublin 15, Ireland, have invented new 
and useful improvements in: 

ORGANIZATION OF DATABASES IN NETWORK SWITCHES FOR PACKET- 
BASED DATA COMMUNICATION NETWORKS 



of which the following is a specification: 



ORGANIZATION OF DATABASES IN NETWORK SWITCHES FOR PACKET-BASED 



DATA COMMUNICATION NETWORKS 
Field of the Invention 



This invention relates to network switches for providing controlled access, in accordance with 
network addresses contained in data packets, to a multiplicity of remote stations. 

Background to the Invention 

A network switch which has a multiplicity of ports disposed, for example, for receiving packet 
data from a variety of sources and other ports connected to groups of remote stations by 
intermediate devices, conveniently termed gateways, requires for its efficient operation a 
database by means of which a network address of a network destination and contained within 
an incoming packet is related to a media access control 1 address, for example identifying a 
respective gateway, and an identification, such as masking data, of the port to which the 
intermediate device or gateway is connected. One way to decode incoming network addresses 
to use a hash table, indexed according to hashed addresses and consisting of pointers to entries 
in an associated data table of which the entries comprise a network address, a media access 
control address and a relevant port mask. 

The state of the art is exemplified by the United States patent to Rostoker et al, No. 5708659, 
issued 13 January 1998. That patent describes in considerable detail the organisation and 
detailed operation of a network switch in which predetermined numbers of bits from a packet 
address is selected to use a hash key, which is used to compute a table address. It further 
discloses the comparison of the contents of the table at that address with packet address 
information so that if the compared information matches, the packets may be transmitted over 
the port associated with the particular destination address. Other features of network 
switches, such as the storage of packets in buffer memories and the use of pointers, are fully 
described in the aforementioned patent. The performance of source address and destination 
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address look-ups is also described in the European patent application EP-0594196 published 
27 April 1994. That describes a hybrid system in which searching of a relative large database 
is performed using a combination of programmable hash algorithms, binary search algorithms 
and a content-addressable memory. 



Both network addresses and media access control addresses are normally quite wide, typically 
comprising 32 bits and 48 bits respectively, In practice however when remote stations are 
grouped by connection to a common intermediate device or gateway, the entries in the 
database contain a large degree of redundancy and therefore occupy substantially unnecessary 
space in a storage medium. In particular, where different remote stations share the same 
gateway, the entries in a data table indexed according to the network addresses of the remote 
stations will contain for each of the network addresses in a given group, identical media access 
control addresses and port masks for each of the stations in the group. A further disadvantage 
in known systems is a need to verify the network address relevant to an entry in a hash table 
of pointers. This arises in practice because network addresses of incoming packets are 
commonly reduced (by hashing) in width, so as to become a size matched for accessing the 
hash table. 

Summary of the Invention 

The present invention provides an improved technique for managing a database in a network 
switch of the character described above. The present invention is based on the use of a data 
table containing a media access control address and a port mask, identifying a port connected 
to the intermediate device identified by the media access control address, and a hash table 
which includes the network addresses of the remote stations, the combined effect being a table 
of network addresses, media access control addresses and port numbers. In the use of such 
a technique, the space in the database for a given media access control address will be 
substantially reduced since it is required only to appear once, the hash table entries for the 
remote stations sharing the same media access control address each including a pointer all 
pointing to the single entry in the data table. 
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Brief Description of the Drawings 

Figure 1 illustrates part of a data communication network including a network switch, various 
local stations, gateways and remote stations; 

5 _ 

Figure 2 illustrates a network switch in which the present invention may be used; 

Figure 3 illustrates a known manner of organising a database for the control of the routing of 
signals through the network switch; 

Figure 4 illustrates an improved technique for managing a database for the network switch; 
and 

Figure 5 is a flow diagram of the process for accessing the hash table and data table. 
Detailed Description of a Preferred Embodiment 

As indicated previously, the invention relates to network switches such as routers, which may 
for example function so that users in different sub-nets may communicate. In a typical system, 
a group of users, which share the same portion of a network address, form a sub-net. A group 
of sub-nets can be connected to a 'backbone' or wide area network (WAN) by a network 
switch. A network switch in this context can build up a table of addresses for each sub-net so 
as to manage traffic between the sub-nets. There may be a 'default' route to the WAN from 
each router. Routers may now respond to network (layer 3) addresses as well as MAC 
addresses and may accordingly be termed 'layer 3 f switches. 

Figure 1 illustrates part of a packet-based data communication network comprising a network 
switch 10 which has various ports connected to local stations 3 and, in this embodiment, two 
ports which are respectively connected to intermediate switches, conveniently termed 
30 gateways, which are themselves connected to a respective group of remote stations. In the 
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greatly simplified network shown in Figure 1 , the first gateway 1 is connected to a first group 

4 of remote stations A, B and C, and the second gateway 2 is connected to a second group 

5 of remote stations D and E. The switch is a Mayer 3' switch, which means that in terms of 
current standards relating to packet format and transmission, it can respond to network 
address es suc h as IP (inte rnet pro tocol) addresses for directing packets from, for example, the 
local stations to the port coupled to the intermediate gateway itself coupled to the remote 
station identified by the network address. 

Figure 2 illustrates a network switch which, apart from the organisation of a hash table and 
data table as described hereinafter, is in a generally known form, for example that described 
in the aforementioned Rostoker at al, US patent 5708659. 

The network switch shown in Figure 2 comprises a multiplicity of ports 1 1,12 and 13 as well 
as other ports (not shown) all coupled bidirectionally to a switching ASIC 14, which performs 
the basic switching operations needed to direct packets to memory, to organise look-ups on 
source and destination addresses in a data table, to perform any necessary contention 
resolution (especially if the network is an Ethernet network) and so on. All these functions 
are well known to those skilled in the art and will not be described in detail. Basically, packets 
received by the ports may be temporarily stored in receive buffers associated with each port 
but on transmission from those receive buffers are directed by the switching ASIC and a bus 
18 to a packet buffer 19, preferably constituted by dynamic random access memory. The 
switching ASIC provides for the storage in a pointer table 20 of pointers to the buffers 19 so 
that the packets can be retrieved from the buffer memory and transmitted onwardly by way of, 
the respective port or ports. 

The switching ASIC also causes, in a manner which is generally known in itself, the storage 
of address data and the relevant port mask in a data table 17. Pointers to the entries in the data 
table 17 are held in a hash table 16, which is accessed by hashing the network address from a 
receive packet by means of hashing logic 15. 
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Figure 3 illustrates the manner in which a hash table 16a and an associated data table 17a are 
organised according to current practice. 

In particular, when a packet is received by the switch 10 from any of the local stations 3 in 
Fig ure 1 and has, for example, an inte nde d destination for one of the rem ote stations in one 
or other of the groups accessed by means of the gateways, the network address (EP) from the 
header part of the packet is employed in a hashed form to look up a pointer in the hash table 
16a, the pointer pointing to an entry in the associated data table 1 7a in which the entries each 
comprise at least the network address of a remote station, the media access control address 
of a gateway by way of which the remote station can be accessed, and a port mask identifying 
the port to which the intermediate gateway is connected. In general, in a practical network 
more complex than that shown in Figure 1 there may be different paths to a remote station by 
way of different gateways. 

For the sake of example, Figure 3 is annotated to show the storage of entries relating to three 
network addresses, IP A. IPB and IPC, identifying the remote stations A, B and C in the group 
4 connected to switch 10 by way of gateway 1. The respective pointers in hash table 16a point 
to entries in the data table 1 7a. An entry identifying remote station A will have the network 
address (IP A), the layer 2 or media access control address (MACG1) of the gateway 1, and 
an identification of the port (port mask 1) to which gateway 1 is connected. 

Currently, when a packet is received by the switch from a local station and has an intended 
destination of one of the remote stations, the network address (IP) from the header part of the 
packet is employed in a hashed form to look up a pointer in a hash table 16a, the pointer 
pointing to an entry in an associated data table 1 7a in which the entries each comprise at least 
the network (IP) address of a remote station, the media access control address of the 
respective gateway to which the remote station is connected and a 'port mask', identifying the 
port to which the intermediate gateway is connected. 

Typically the hashed addresses are formed by performing arithmetic operations on the network 
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address. The operation may comprise an XOR function such as a folded XOR function. The 
operation reduces the width of the network address from n bits to m bits, where m < n. The 
hash table is normally a sparsely populated table; the more sparsely populated it is, the greater 
the probability of a unique match for a given hash function, i.e. a single location in an 
asso ciated data t able. Since how ev er different ne t work addresse s could be h ashe d t o the sam e 
result, the entry to which a pointer points must be checked to verify a match. If there is no 
match in a first location in the associated data table pointed to by the pointer, there will be a 
linked list of other possible matches each of which may have to be searched to find a correct 
match. 

As may be seen from Figure 3, there is an entry in the associated data table for every remote 
station. These entries are necessarily wide because network addresses and media access 
control addresses are typically wide (for example 32 bits and 48 bits respectively), and in 
practice for network addresses and media access control addresses of this width, the data table 
needs to be 128 bits wide. Further, since the media access control address of a gateway 
connected to a multiplicity of remote stations is the same for each of the entries in the data 
table of those stations, there is a high degree of redundancy of the entries within the associated 
data table. 

Figure 4 illustrates an improved technique for managing the controlling database in a switch 
of this nature. The technique can reduce required data space and avoid the need to verify 
hashed addresses. 

In the scheme shown in Figure 4, an incoming data packet having a network address (IP A, 
IPB, IPC etc) is caused to generate an entry in the hash table 16 (if it be a new address) and 
to provide access to an address pointer in the hash table if such entry is already there. The hash 
table contains a multiplicity of entries each of which contains the network address of a remote 
station and an address pointer which points to the entry in the associated data table 17 
containing the media access control address (e.g. MACG1) and the port mask defining the 
intermediate device (the gateway) to which the remote station is connected and, respectively, 
the port to which the associated gateway is coupled. 
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It may be noticed that the associated data table does not include the network address of the 
remote station. Further, those remote stations which share the same gateway have entries in 
the hash table including pointers pointing to the common entry in the associated data table- 
Thus there is only one data entry in the associated data table for all remote stations accessed 
5 through the respective common gateway. 

Each entry in the hash table comprises a validity bit 161, an IP address 162, an address pointer 
163 to a location in the associated data table and a link pointer 164, as shown for the first entry 
in the table 16. The hash table is accessed by hashing an IP address of the incoming packet 
10 (or part of the IP address) to develop a pointer to a location in the hash table. 

=r 2 Since hashing normally reduces a relatively wide address to a relatively narrow pointer (e.g. 

RJ from 32 to 16 bits) it is normally desirable to link entries in the hash table by means of link 

m pointers so as to ensure that all the relevant entries will be examined. Further, a hash 

515 generator typically has a width (e.g. 16 bits) somewhat narrower than the address which is 

hashed and a multi-stage hashing process would usually be employed. Figure 5 illustrates a 
O typical look-up process employing a database organised according to Figure 4. 

Hi The IP address (or part of it) at stage 50 is hashed (stage 51) to develop an address pointer 

J20 AddrPntr (stage 52), which may be formed from the 'hash result' and an offset, hashBasePntr. 

The address pointer is used to read the respective location in the hash table (stage 53). The 
contents of that location are latched (stage 54) to allow a test of the validity bit (stage 55) and 
a test for an IP address match (stage 56). If the validity bit is '0', the location is empty (i.e. 
there is no valid entry) and there is no match (stage 47) between the hashed address and the 
25 table 16. Normally such a negative result would be followed by a process of learning the IP 

address (stage 58) and the creation of an entry in the hash table, in the customary manner of 
building look-up tables. If the entry is valid, the validity bit being T, a check is made for 
identity between the IP address in the respective packet and the IP address in the hash table 
entry. If the entry contains the correct IP address, the entry in the associated data table to 
30 which the pointer points is read (stage 59) and the result (the MAC address and port mask) 

is fed to the switching engine (stage 60). If however the IP addresses do not match, the link 
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pointer of the examined entry is used as the address pointer (stage 61) and the stages 53 to 56 
repeated until an EP address match is found. 

It may be noted that in some circumstances it is not essential that the full IP address be used, 
especially when the remote stations have EP addresses which have a degree of similarity. In 
particular, it is feasible to perform a progressive look-up (known as a trie search) on an IP 
address. In such a scheme the last search stage comprises a table of partial IP addresses which 
may be hashed to access the associated data table. Thus the term 'network address' is 
intended to embrace address data having the character but not necessarily the totality of a full 
network address. 
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